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ABSTRACT 

Analytic derivations of the correlation function and the column density distribution 
for neutral hydrogen in the intergalactic medium (IGM) are presented, assuming that 
the non-linear baryonic mass density distribution in the IGM is lognormal. This ansatz 
was used earlier by Bi & Davidsen (1997) to perform ID simulations of lines-of-sight 
and analyse the properties of absorption systems. We have taken a completely ana- 
lytic approach, which allows us to explore a wide region of the parameter space for our 
model. The analytic results have been compared with observations to constrain var- 
ious cosmological and IGM parameters, whenever possible. Two kinds of correlation 
functions are defined : (i) along the line-of-sight (LOS) and (ii) across the transverse 
direction. We find that the effects on the LOS correlation due to change in cosmology 
and the slope of the equation of state of the IGM, 7 are of the same order, which 
means that we cannot constrain both the parameters simultaneously. However, it is 
possible to constrain 7 and its evolution using the observed LOS correlation func- 
tion at different epochs provided one knows the background cosmology. We suggest 
that the constraints on the evolution of 7 obtained using the LOS correlation can be 
used as an independent tool to probe the reionisation history of the universe. From 
the transverse correlation function, we obtain the excess probability, over random, of 
finding two neutral hydrogen overdense regions separated by an angle 9. We find that 
this excess probability is always less than 1 per cent for redshifts greater than 2. Our 
models also reproduce the observed column density distribution for neutral hydrogen 
and the shape of the distribution depends on 7. Our calculations suggest that one can 
rule out 7 > 1.6 for z ~ 2.31 using the column density distribution. However, one 
cannot rule higher values of 7 at higher redshifts. 

Key words: cosmology: large-scale structure of universe, power spectrum - inter- 
galactic medium - quasars: absorption lines 



1 INTRODUCTION 



ing, star formation etc., in the N-body simulations. Because 
of such complications, our understanding of the formation 
of baryonic structures has been limited. 



The nature and evolution of the initial power spectrum of 
density fluctuations could be obtained by studying the dis- 
tribution of objects at different scales and different epochs. 
The formalism for studying the formation of dark matter 
(DM) structures is well established, as they are collisionless 
particles interacting only through gravity, and has been ex- 
tensively studied using the large cosmological N-body simu- 
lations. However, in order to model the evolution of baryonic 
structures like galaxies, groups of galaxies etc. one needs to 
incorporate all the hydrodynamical processes, heating, cool- 



Among the various baryonic structures, the regions 
where one can neglect the star formation are comparatively 
easier to study. Two such areas are (a) low amplitude fluc- 
tuations in the intergalactic medium (IGM), where the star 
formation rate is very low, and (b) the intracluster medium, 
where one studies processes over large scales and thus the 
star formation details can be neglected. Hence considerable 
effort has been given in understanding these two types of 
structures. 



* E-mail: tirth@iucaa.ernet.in 
t E-mail: paddy@iucaa.ernot.in 
t E-mail: anand@iucaa.ernet.in 



The baryonic matter distribution at 2 < 5 is well probed 
through the absorption signatures they produce on the spec- 
tra of the distant QSOs. It is widely believed that while the 
metal lines systems (detected through Mg 11 or C iv dou- 
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blets) seen in the QSO spectra could be associated with 
the halos of the intervening luminous galaxies (Bergeron & 
Boisse 1991; Steidel 1993), most of the low neutral hydrogen 
column density absorption lines (commonly called as 'Lya' 
clouds) are believed to be due to low amplitude baryonic 
fluctuations in the IGM. 

Semi analytical as well as hydrodynamical simulations 
are consistent with the view that the Lya clouds are small 
scale density fluctuations (Bond, Szalay & Silk 1988; Cen 
et al. 1994; Zhang, Arminos & Norman 1995; Hernquist et 
al. 1996; Miralda-Escude et al. 1996; Bi & Davidsen 1997; 
Riediger, Petitjean & Miicket 1998; Theuns, Leonard & Ef- 
stathiou 1998; Theuns et al. 1998; Dave et al. 1999) that 
are naturally expected in any standard structure formation 
models. This idea is supported by the detection and the evo- 
lution of the weak clustering among the Lya clouds in the 
redshift space (Cristiani et al. 1995; Srianand 1997; Khare et 
al. 1997) . Subsequently it is realised that the thermal history 
of the Lya line forming regions depends on (i) epoch of reion- 
isation (equation of state) (ii) rate of photoionisation and 
(iii) adiabatic cooling. One can in principle neglect shocks 
and other processes that are important only in the highly 
non-linear regime. However a simple linear evolution of the 
densities will fail to produce the saturated Lya systems and 
one needs to incorporate non-linearities in the model. 

As a first step, one can model the non-linear evolution 
of the baryonic fluctuations that produce Lya clouds using 
one of the several approximations like: (i) Zeldovich approxi- 
mation (Doroshkevich & Shandarin 1977; McGill 1990; Hui, 
Gnedin & Zhang 1997), (ii) lognormal approximation (Bi 
1993; Gnedin & Hui 1996; Bi & Davidsen 1997). or (iii) 
power law approximation (Bi, Ge & Fang 1995) (strictly 
speaking, the baryonic fluctuations are calculated here us- 
ing the linear theory). In all these cases the baryon density 
is estimated from the DM density by some rule and the 
neutral fraction is estimated by considering the equilibrium 
between the rate of photoionisation due to background ra- 
diation and the rate of recombination estimated from the 
temperature defined through the equation of state. All these 
models depend on various IGM parameters such as intensity 
of the background radiation, equation of state and density- 
averaged temperature as well as the cosmological parameters 
like fim, Ha, etc. 

Observationally the statistical properties of the Lya ab- 
sorption lines are quantified through the column density dis- 
tribution, correlation functions and their dependence on the 
mean redshift. The clustering properties of the Lya absorp- 
tion lines are studied through two point correlation function 
obtained either (a) in the redshift space using the lines de- 
tected along a single line-of-sight (LOS) which we call "LOS 
correlation function" , or (b) among the absorption lines de- 
tected along the lines of sights toward a few closely spaced 
QSOs which we call "transverse correlation function". In 
either case the observed spectra is decomposed into clouds 
using "Voigt" profile fits. Though this process smoothens the 
density field over the width of the lines the average effects 
due to thermal broadening is taken care of by the Voigt pro- 
files. One can also compute the two-point correlation func- 
tion of the observed flux in different pixels. As this process 
does not decompose the actual density fields into cloudlets, 
in order to analyse the data the models should incorporate 
the thermal broadening and blending of contribution from 



different density fluctuations (Croft et al. 1999; McDonald 
et al. 1999). Most of the existing studies concentrate on ob- 
taining constraints on the cosmological parameters using the 
observed statistical properties. Comparatively very less ef- 
fort is directed to understand how the observed quantities 
depend on the physical conditions in the IGM. 

In this work we make a preliminary attempt to investi- 
gate the dependence of the observable quantities on various 
parameters of the models using a simple analytic approach. 
We derive analytic relations for the two-point correlation 
function among the Lya clouds and the column density dis- 
tribution using a lognormal approximation. These equations 
are used with the observed Voigt profile fitted data to get 
constraints on different IGM and cosmological parameters. 
In section ^, we treat the non-linear evolution with a simple 
ansatz proposed by Bi & Davidsen (1997) for the baryonic 
density fluctuations, and derive analytic expressions for the 
correlation function along the LOS and in the transverse 
direction and the column density distribution. The model 
parameters used to obtain various results are discussed in 
section ^. In section ^, we study the correlation function at 
different redshifts for different structure formation models 
and for different values of the IGM parameters such as the 
density averaged temperature and the equation of state. We 
compare some of our results with the existing observational 
data. We also present the results for the column density dis- 
tribution and study its dependence on various cosmological 
and IGM parameters. The results are summarised in section 

I 



2 ANALYTIC MODEL 

The linear density contrast for dark matter in comoving k- 
space, for a particular redshift z, is given by 



5umik,z) = Z)(2:)(5DM(fe, 0), 



(1) 



where D(z) is the linear growth factor for the density con- 
trast, normalised such that D{Q) — 1. If we assume the linear 
density contrast to be a Gaussian random fleld, then the cor- 
responding linearly extrapolated power spectrum PoM{k) is 
deflned by 



(5DM(fe,0)foM(fc',0)) = (27r)^PDM(fc)(5Dirac(fe-fc') 



(2) 



The power spectrum is only a function of the magnitude of 
fc, because of the isotropy of the background universe. 

The linear density contrast for baryons in the IGM can 
be obtained from the DM density contrast by smoothing 
over scales below the Jeans length. We use the relation (Fang 
et al. 1993) 



5B{k,z) = -^—-^ 



where 



Xb{z) = 



1 -Ha;2(z)fc2' 

2-ykBTm{z) 



Ho 



3fimpQ,m{l + z) 



1/2 



(3) 



(4) 



is the Jeans length; Tm and /x are the density-averaged tem- 
perature and mean molecular weight of the IGM respec- 
tively; r2„i is the cosmological density parameter of total 
mass and 7 is the ratio of specific heats. Strictly speaking. 



© 2000 RAS, MNRAS 000, 



Semi analytic approach to understanding the distribution of neutral hydrogen in the universe 3 



equation (^) is valid only for the case where xi, is indepen- 
dent of 2, but it is shown by Bi, Borner & Chu (1992) that 
equation (^) is a good approximation for (5b (fc, z) even when 
Xb has a redshift dependence. The linear density contrast 
in real comoving space, S{x,z), is the Fourier transform of 
equation 

In principle, to study the properties of the IGM one 
has to take into account the non-linearities in the density 
distribution and various physical processes such as shocks, 
radiation field, cooling etc. However, detailed hydrodynam- 
ical modelling of IGM has shown that most of the low col- 
umn density Lyo? absorption (i.e. A^hi < 10^"* cm~^) are 
produced by regions that are either in the linear or in the 
weakly non-linear regime (Cen et al. 1994; Zhang, Anninos 
& Norman 1995; Hernquist et al. 1996; Miralda-Escude et 
al. 1996; Theuns, Leonard & Efstathiou 1998; Theuns et al. 
1998; Dave et al. 1999). The lower envelope of the column 
density, A'^ei versus the thermal velocity dispersion, b (given 
by b = {2kBT/mpY'^) scatter plot (Schaye et al. 1999a; 
Schaye et al. 1999b) suggests that there is a well defined re- 
lationship between the density and the temperature of the 
IGM (Hui & Gnedin 1997). Thus it is possible to model low 
column density systems using simple prescription for the 
non-linear density field and an equation of state. 

In this work, we take into account the effect of non- 
linearity by assuming the number density distribution of the 
baryons, n-aix, z) to be a lognormal random field 



n^{x,z) = Ae*=^<^'"' 



(5) 



where A is a constant to be determined. The mean value of 
n-B{x,z) is given by 

{nB(a;,2))=no(z) = ^(e'^(^-^') (6) 
Since S-b(x, z) is a Gaussian random field, one can write 



where 



A\z) = {5Ux,z))=D'(z) 
Hence, 

A = no(z)e-^'W/^ 
and 

nB{x,z) — no{z) exp[5B{x, z) 



(27r)3 (l + a;2(z)fc2)2' 



(7) 



(8) 



(9) 



(10) 



The lognormal distribution was introduced by Coles & Jones 
(1991) as a model for the non-linear matter distribution in 
the universe. This ansatz has several interesting features: 

(a) It can be seen that the matter density given by equa- 
tion (^^ is always positive, even when 5b — > — oo, unlike 
any polynomial function of 5b- When the density contrast 
is small {5b <C 1), equation ( p^ ) reduces to ne/i-o ~ 1 + &, 
which is just what we expect from linear theory. 

(b) On small scales, equation ( p^ ) becomes the isothermal 
hydrostatic solution, which describes highly clumped struc- 
tures like intracluster gas, ub oc exp(— ^mpi/)DM/7fcsr), 
where ipoM is the dark matter potential (Sarazin & Bah- 
call 1977). The lognormal function can be thought of as 
the simplest function which links these two extreme regions 
smoothly. 



(c) One can also think of the lognormal distribution as the 
kinematic model for the density field. If one assumes that 
the initial density and velocity fields are Gaussian, and ex- 
trapolates the continuity equation into non-linear regimes, 
treating the velocity field as linear, it turns out that the 
non-linear density field obtained in such a manner follows 
the lognormal distribution (Coles & Jones 1991). 

(d) Bi & Davidsen (1997) have tested the distribution 
against hydrodynamical simulations, and found a reason- 
able match between them. The lognormal assumption has 
also been used to model the IGM in numerical simulations 
(Bi 1993; Bi & Davidsen 1997) and is found to be work- 
ing well in reproducing the observations. In particular, the 
simulation results matches well with the observed column 
density distribution and number density of the Lya absorp- 
tion lines, the probability distribution of the b parameter etc 
(see Bi & Davidsen 1997). 

We shall also discuss briefly a more general argument as 
to why the lognormal distribution should be natural choice 
in a large class of phenomena. There is a wide class of quan- 
tities, denoted by /, the time evolution of which can be 
characterised by the following property - the change in the 
value of / at some instant ti is proportional to its value at 
that instant, with the proportionality factor being a random 
variable. In mathematical notation, this can be written as 
/(ii+i) = f{ti) + £if{ti), where £i is the random variable. 
(Some examples of such phenomena in sociological context 
are (i) rich getting richer through fluctuations in stock mar- 
ket, and (ii) more facilities being provided to people who 
already have them.) Similar situation can occur in struc- 
ture formation scenario also. The regions which have high 
density, because of stronger gravitational attraction, have 
a better chance of acquiring more mass. Let us denote the 
density field at some particular point at a given epoch ti by 
n{ti) and postulate the evolution. 



n(ti+i) = n{ti) + £in{ti) = (1 + £i)n{ti) 



(11) 



We can now write n{ti+i) in terms of some initial density 
field 71 (to) 

n{U+i) ^{1+ eO(l + ■ • • (1 + eo)n(io) (12) 
Taking logarithm of both sides 



ln[n(t,+i)] = ln(l -f- e,) + \n[n(to) 



(13) 



3 = 



It is clear that when the time intervals (ti+i — ti) are small, 
the mass acquired within that interval will also be very 
small. Hence we expect that ^ 1. Then the above ex- 
pression becomes 



In 



n{ti. 



n{t()) 



J=0 



(14) 



This means that \n[n(t) / n{to)] is a sum of a large number of 
uncorrelated random variables. Using the central limit theo- 
rem, we can conclude that it follows a Gaussian distribution 
or, equivalently, n{t) follows a lognormal distribution. This 
suggests that it may be reasonable to try an ansatz that 
the distribution of the non-linear baryonic density field is 
lognormal. 
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As an aside, we just mention that our analysis described 
here can easily be carried out for any other local ansatz for 
the non-linear baryonic density. [The results for power law 
assumption in which tib oc (1 + 5Y will be discussed in a 
later paper.] 

Once we have obtained the total baryonic density, the 
fraction of hydrogen in the neutral form, /, in the IGM can 
be obtained by solving the ionisation equilibrium equation 



a{z,T{z))npne = J{z)nm, 



(15) 



where a{z,T{z)) is the radiative recombination rate and 
J(z) is rate of photoionisation for hydrogen at redshift z 
(Black 1981); np,n^ and tihi are the number densities of 
proton, electron and neutral hydrogen, respectively. For sim- 
plicity, we assume that hydrogen is the only element present 
in the IGM and neglect the presence of helium and other 
heavier elements. In such a case, we have rie = rip. (This re- 
lation is not valid in the presence of helium or other heavier 
elements. If we have taken their presence into account, we 
would have got = /trip, where k is a constant. Usually, 
1 < « < 1-2, because the amount of helium and heavier 
elements in the IGM is small compared to hydrogen. Since 
we do not know J[z) beyond an accuracy of 10-20 per cent, 
we can always absorb k into J{z) .) Let us define the neutral 
fraction of hydrogen, / by 

/ = ^ = (16) 



riHi + rip 



Hence we get from equation (|15| 



J{z) 



f a{z,T{z))nB- ^^^^ 

In general, one can solve this equation and determine / as 
a function of ns . This expression simplifies for two extreme 
cases. For / ^ 1, we get 

a{z,T{z))n^ 



f J{z) 
and for / ~ 1, 



/ = 1 



J{z) 



a(z,T{z))nB ' 
Hence, we have 

(-J^nl[.,z) 



(18) 



(19) 



nm[x, z) = 



(if nni < ne) 



n^{x, z) 



(if nm-ne) 



The ionisation conditions in the Lya absorbers are similar 
to the of H II regions with / ~ 10"*. Thus, from now on we 
concentrate only on the case nni ^ riB. 

We take the temperature dependence of the recombina- 
tion coefficient a to be given by (Ranch et al. 1997) 



a{z,T{z)) = ao 



(21) 



where ao = 4.2 x 10~^^ cm"^ s~^. This relation is a good ap- 
proximation for Q in the temperature range relevant for Lya 
forest. The temperature T is related to the baryonic density 
n through the equation of state. We assume a polytropic 
equation of state p cx oc , or equivalently 



T{z) = To{z)[nB{z)/no{z)] 



7-1 



(22) 



where 

no(z) = 



(23) 



Pi, nip 

is the mean baryonic number density at redshift z. pc is the 
critical matter density at the present epoch, given by 



Pc = 1.8791 X 10"^^/i^cm"^ 



(24) 



and ptrrip is the mass per baryonic particle. Then, the H I 
density becomes 



nm{x,z) = F{z) 



where 



nB{x, z) 
no(z) 



F{z) = a,nl{z)[^\ J-\z) 



and 

= 2.7 -0.77. 



(25) 



(26) 



(27) 



(We note, in passing, that /3 becomes negative if 7 > 3.86.) 
We can write the H I density in terms of the linear baryonic 
density contrast 



nm{x,z) = F^_{z) exp[/3(5B(a;, z) 
where 

Fi(2)=F(z)e-''^'W/^ 



(28) 



(29) 



It is clear from equation (g8|) that the H I distribution at 
a particular redshift is also described by a lognormal distri- 
bution. All the statistical quantities regarding H I can be 
derived from this in a straightforward manner. 



2.1 Correlation Function for Neutral Hydrogen 

One of our main interest is the correlation function 

(nHi(a;, 2;)nHi(a;', z')) = 

F^{z)Fi{z'){e^v{P[5B{x,z)+5B{x',z')]}), (30) 

from which several useful quantities can be obtained. Since 
5b is a Gaussian random field, we can write, using equation 

(0) 

(exp{/3[&(x,z) +5B(a;',z')]}) = 



exp[^{A2(z) + ^\z') + 2Q{x, x'- z, z')}] 
where 

Q{x,x';z,z) = {5b{x,z)5b{x' ,z')). 

Simple algebra gives 

Q{x,x' \ z, z') = Q{x — x' ; z, z') = 



D(z)D(z') 



i(fc)e 



(27r)3 {l + xl{z)k^){l + xl{z')k^)' 



One also notes from equations (^) and (H) that A^{z) 
Q{0; z,z). We can now write equation 



{nm{x, z)nm{x', z )) = ^2(2)^2(2 ) 
where 



I^^I3-'Q{X-X 



(31) 
(32) 

(33) 

z) = 

(34) 
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F2{z) = Fi{z)e' 



(35) 



One needs to normalise the quantity 
{nm{x, z)nui{x' , z')) , to obtain the correlation function 
^mix — x'; z, z') for H I. A natural way of normalising the 
correlation would be to use the definition 



{nm(x, z)){niii(x ,z')) 
Since {nm.{x,z)) — ^2(2), we get 

t f I i\ 3^Q{X-x' ■.z,z') 



1 



(36) 



(37) 



with Q given by equation (p3[). 

All the analysis above is valid if one can probe any scale 
with arbitrary accuracy. But it turns out that one cannot 
obtain information about scales smaller than some partic- 
ular value, due to various observational constraints. While 
observing along a LOS, it will be impossible to probe the 
velocity scales less than the spectroscopic limit due to ther- 
mal broadening and the blending of spectral lines. Similarly, 
while observing across the transverse direction, the peculiar 
velocities of individual points will constrain the velocity res- 
olution which we have not taken into account in the above 
analysis. If A« is the smallest scale one can probe, then the 
corresponding limit in the redshift-space is 

A2 = — (1 + 2). (38) 
c 

This means that we will not be able to probe below a co- 
moving length scale given by 



I^x{z) 
where 



dH(z)A2, 



d.H'yz) = C 



-^{^tv + n„(l + zf + + zf\ 



1/2 



(39) 



(40) 
(41) 



This effect can be included in our calculation by smoothing 
over all the length scales smaller than ^x(z), where 



(42) 



is the average redshift. We use a Gaussian window of width 
(Jx [z] = ^x(z), and get a smoothed version of Q in equation 
(p3|). In Fourier space, this smoothing will introduce an extra 
Gaussian term in the integrand, and our smoothed Q will 
be 



Qsmooth ("^ ^ 5 -^5 ) 

d^fc PDM(fc)e- 



D{z)D{A 



l(z)/2jk.(x-x') 



(43) 



(27r)3 {l + xl(z)k^){l + xl{z')k2) 

The angular integrations can be carried out trivially, and we 
get 

_ D{z)D{z') 

^smooth l^*^ X ^ Z f f-»o 

PDM(fc)fc'e-'''"''^')/^ sinfcX 
,g il + xl(z)k^){l+xl{z')k^) kX ' ^ ' 

where X — \x — x'\. The final integration can be done nu- 
merically, once the DM power spectrum is given. 



At this stage, the relations derived above can be used 
for any x, x' , z, z' . As we mentioned earlier, if one observes 
the H I along a particular LOS, then one is probing different 
regions of the IGM at different redshifts. The position x will 
be related to the redshift z by the relation 



x{z) 



dH{z')dz' 



(45) 



where dniz) is given by equation (^). Then the LOS cor- 
relation function is given by 



^^'^^{z,z')^e' 
where 

QhOsHiz, z'); z, z ) 
27r2 



/3^Qlos{'(^.z');2,z') 



(46) 



dk 



PDM(fc)fc'e-'=''''«/2 gijj^; 



(1 -f x2(2)fc2)(l +x2(2')fc2) kl 



(47) 



and 

l{z,z') = x{z) - x{z'). (48) 

It should be stressed that S,ki^{z,z') / ^m^{z - z). This 
means that one cannot rigorously define a power spectrum 
from the LOS correlation function because the correlation is 
a function of two variables z and 2'. In other words, the LOS 
power spectrum does not exist in strict sense. However, one 
can get an approximate LOS power spectrum for a small 
redshift range around any mean redshift. We have already 
defined the average redshift in equation (ji^). We define a 
redshift difference 

Az = 2 - 2' (49) 

and evaluate the correlation function for a particular value 
of 2 as a function of A2, i.e.. 



^Jj°^(2-,A2)=e'' 
where 

QhOs{z, Az) = 

D{z + Az/2)D{z - Az/2) 



(50) 



dfc 



sin(A;ci_H-(2) A2) 



27r2 /„ [ kdH[z)Az 
PDM(fc)fc'e-'°'"^'^''/^ 

^ (1 +x2(2-(- A2/2)A:2)(1 -f x2(2- A2/2)fc2) 

(51) 

For small A2, one can use equation (^) to write the correla- 
tion as a function of Aa;, Fourier transform the correlation, 
and get the power spectrum. Such a power spectrum will de- 
pend on the value of 2. We stress again this power spectrum 
is approximate in the sense that it exists only for A2 <^ z. 

The transverse correlation is observed at some particu- 
lar redshift (2 = 2'), along the transverse direction. Then 



?HI l*±, 2j — e 



(52) 



Qtrans('±; z) 

27r2 



dfc 



PDM(fc)fc'e-'=''^'(^'/2 sinfc/j 



(1 



(2)fc2)2 



klj 



(53) 
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where l± is the comoving distance along the transverse direc- 
tion. For a given redshift, the transverse correlation is only 
a function of Hence, one can obtain the power spectrum 
from 5hT"° following usual methods. 

2.2 Column Density Distribution 

One of the other statistics the observers use to quantify the 
distribution Lya absorption lines is column density distri- 
bution. Indeed one can get the analytic expression for this 
using the formalism developed so far in this work. Note that 
Voigt profile fitting to the absorption lines are used to get the 
observed column density distribution. Here we use a method 
called 'density-peak ansatz' (DPA), discussed in Gnedin & 
Hui (1996) and Hui, Gnedin & Zhang (1997) to derive an 
analytic expression for the column density distribution. 

Suppose we are looking at the IGM along any one di- 
rection, at some redshift z. Then the linear density field 
5^^\x,z) along that LOS will be described by a one di- 
mensional Gaussian random field. DPA assumes that each 
density peak in the comoving space is associated with an ab- 
sorption line, and one can assign a definite column density 
to each of them. In the articles referred above, each density 
peak is fitted with a Gaussian, and the column density is 
calculated using 

A^Hi oc / nmix) dx. (54) 

J peak 

In such a case, there is a definite correlation between the 
value of the density field at the peak, and the effective width 
of the absorber (which is determined by the correlation be- 
tween the density field and its second derivative at the peak, 
and is fixed once the fitting function for the density peak is 
given). We, however, take a simpler approach in assigning 
the column density to a density peak, which is described 
below. 

The coherence scale of the distribution is defined as 
(Bardeen et al. 1986) 

(72 ' 



The relation between A^'hi and 5^^^\ is then given by 



(55) 



where cti and 02 are defined in equation (see Appendix 
A). This length is a measure of the distance between two 
successive zeroes for the one dimensional Gaussian random 
field. Since this is the relevant scale for the distribution of 
zero-crossing, we expect the effective length scale of a peak 
to be a fraction of R* . Then the column density correspond- 
ing to a particular peak will be 



A'hi oc nHi[peak]R* = nHi[peak]R*e 



(56) 



where nHi[peak] is the H I number density at the peak and 
e is the proportionality constant, which can be used as a 
free parameter in comparing with observations. We have as- 
sumed e to be independent of A''hi, which means that the 
column density is directly proportional to the peak den- 
sity. Using this prescription for obtaining the column density 
from the H I density, we can easily obtain the relation be- 
tween A^Hi and the total baryonic over-density n^/riQ using 
equation (^o|). For the case nni ^ wb, the relation is given 
by 



wb [peak] 
no 



iVni 



1//3 



(57) 



5g ^'[peak] = i In 



13 lR*eF(z 



+ 



(58) 



Given this relation, it is straightforward to obtain the quan- 
tity dA'pk/(d2: dA'ui), defined as the number of clouds 
(peaks) per column density interval per redshift interval. 
For completeness, we give the relevant calculations in the 
appendix. 



3 MODEL PARAMETERS 

In this section we discuss about the various model parame- 
ters used in obtaining the results. The parameters defining 
the model can be divided into two categories : (i) cosmolog- 
ical parameters, and (ii) parameters related to the IGM. 

The first set of parameters are those which determine 
the background cosmology. We assume that the background 
universe is described by the FRW metric. We have consid- 
ered four different cosmological models with the parameters 
listed below: 

SCDM n„ = 1, r^A = 0, ft = 0.5 
OCDM = 0.35, = 0, /i = 0.5 
LCDMl Vim = 0.35, f^A = 0.65, h = 0.5 
LCDM2 Vim = 0.35, t^A = 0.65, h = 0.65 

The next cosmological input that is required is the form of 
the DM power spectrum. We take the following form for 
fbM(fc) (Efstathiou, Bond & White 1992) 

Ak 



(1 + [afc-h (fefc)i-5 + {ckYYy/^ 



(59) 



where v = 1.13, a = (6.4/r)/i"^ Mpc, b = (S.O/r)/!.-^ Mpc, 
c = (1.7/r)/i~^ Mpc and F = Q.mh. The normalisation pa- 
rameter A is fixed through the value of erg (the rms density 
fluctuation in spheres of radius 8 h^^ Mpc). We take the val- 
ues of erg to be given by (Eke, Cole & Frenk 1996) 



(0.52 ± 0.04)!:!-°-^''+°-i''"'" (if ^A = 0) 
- ^ (0.52 ± 0.04)t7-°-52+o-"n. 



' (if Oa = 1- 0™) 



(60) 



The next set of parameters are related to the physical 
conditions in the IGM. The parameters we need to describe 
the IGM are 7, Tm, f^baryon, J(z) and To{z). All these quan- 
tities were defined in section S. Besides these, we have also 
introduced a parameter e while modelling the column den- 
sity distribution of the IGM. This is taken as a free param- 
eter, to be fixed through observations. 

It is known that the value of 7, at any given epoch, 
depends on the reionisation history of the universe (Hui & 
Gnedin 1997). The value of 7 and its evolution is still quite 
uncertain. Using Voigt profile fits to the observed Lya ab- 
sorption lines one can in principle obtain the value of 7. 
Available observations are consistent with 7 in the range 
1.2 < 7 < 1.7 (Schaye et al. 1999b) for 2 < z < 4.5. As far 
as the evolution of 7 is concerned, we shall treat it as inde- 
pendent of z. The density averaged temperature is defined 



/ p{T) T AT 
J P{T) AT 



(61) 
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Using the equation of state T ^ , we get 
27 — 1 

Tfii ('2^max " ^min)- (^^) 

7 

We take r„ to be in the range 10,000 K< T„ <60,000 K. 
The minimum value of Tm corresponds to the minimum tem- 
perature of the IGM, which is determined by the photoion- 
isation equilibrium. The maximum value of Tm corresponds 
to 6 parameter ~ 31.7 km s~^ and is consistent with the 
minimum value of 6 observed at higher H I column densi- 
ties (i.e. 10^*'^ cm~^). The evolution of depends on how 
Tinax and Tniin cvolve with redshift. One possibility is to take 
the adiabaticity relation Tm ~ (1 -I- z)"^^"^. However, people 
have argued that since there is no conclusive evidence of the 
evolution of the temperature in IGM, one should treat the 
mean temperature of the IGM as constant (Bi et al. 1995). 
Hence, in this paper, we also consider the second possibil- 
ity where Tm is independent of z. For the cases where we 
consider a small redshift range Az around a mean redshift 
z, the effect of the evolution of Tm is not very significant. 
But, whenever we study the evolution of a quantity over a 
large redshift range, we have to take into account the various 
redshift dependences of Tm- 

We note that when we normalise the correlation func- 
tion for neutral hydrogen, the parameters f2baryon, J{z) and 
To(«) cancel out. Hence the knowledge of these parame- 
ters are not necessary for modelling the correlation function. 
However, in the case of column density distribution, they ap- 
pear as a combination {Q.\ys,ivoiih^)'^ { z)Tn'^''^ { z) through 
F{z) (equations (^^ and (^^). The values of these quanti- 
ties are not known accurately, nor do we know how J{z) and 
To[z) evolve. In this work, we take J{z) to be independent 
of z. We fix the value of the combination (f2baryonft^)^ to 
be (0.026)^ X 10^"^ s, which is consistent with the values given 
in McDonald et al. (1999). One should note that any change 
in the values of the above parameters can be compensated 
(to some extent) by changing the value of e, which is a free 
parameter in our model for the column density distribution. 
For To evolution, we shall consider two separate cases like 
Tm, i.e., (i) To ~ (H- zf'''^ and (ii) To independent of z. 

As we have discussed earlier, we need to smooth the 
power spectrum below some velocity because the blending of 
spectral lines makes it impossible to resolve the lines below 
a particular velocity. Typically this velocity is of the order of 
a few tens km s~^. For definiteness, we take the smoothing 
velocity to be At; = 30 km s~^. 



4 RESULTS 

4.1 LOS Correlation 

We shall now compute the results for the H I correlation 
function along the LOS as a function of the velocity separa- 
tion V, where v is related to /S.z hj v — cAz/(l -I- z). 

The results for the LOS correlation function for different 
cosmological models are shown in Figure |l| We have chosen 
typical values for Tm as 40,000 K and 7 = 1.5, at a redshift 
oi z = 2.5. It can be seen that the correlation curves tend to 
flatten at low velocities, and goes to zero at high velocities. 

Miralda-Escude et al. (1996) give the correlation func- 
tion of the transmitted flux along a LOS (see the solid curve 
in their Figure 12a). We note that the correlation curves for 



7=1.5, T^ = 40,000K, Av=30 km s"', 
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Figure 1. LOS correlation function as a function of velocity 
separation. Results for four different cosmological models at a 
mean redshift of S = 2.5 are presented. The correlation function 
is normalised to unity at zero velocity separation. 



Table 1. Power law index, p, for LOS correlation function, de- 
fined by 5los('!') <x in the velocity range 100—1000 km 
for z = 2.5. We also give the Icr error in p. 







7 = 1.2 


7 = 1.7 




SCDM 


2.03 ±0.015 


1.78 ±0.004 


Tm = 10, 000 K 


OCDM 


2.29 ±0.036 


1.81 ±0.012 




LCDMl 


1.93 ±0.017 


1.63 ±0.003 




LCDM2 


2.19 ±0.029 


1.83 ±0.008 




SCDM 


1.94 ±0.003 


1.68 ±0.015 


Tm = 60, 000 K 


OCDM 


2.10 ±0.014 


1.62 ±0.011 




LCDMl 


1.83 ±0.005 


1.53 ±0.012 




LCDM2 


2.08 ±0.013 


1.71 ±0.009 



the transmitted flux and the neutral hydrogen density need 
not be exactly the same. However, we expect that the broad 
features and the general trends should be alike. It turns out 
that the correlation curve for transmitted flux obtained from 
the simulations does have the same trend as our results. 

We can compare how the shape of 5los(i') depends on 
various parameters. We found that the ^LOs(t') curve falls 
approximately like a power law, ^los(i') cc , at velocities 
within 100-1000 km s^^. At lower velocities the curve is 
practically independent of v. We have given the value of p 
and the rms error on p for different cosmological models and 
for different IGM parameters in Table 0. 

The dependence of p on the IGM parameters can be 
understood easily. Higher value of Tm implies a larger Xb 
which, in turn, implies more smoothing of the power spec- 
trum at low scales. However, the larger scales are more or 
less unaffected by Xb- Consequently, the correlation curve 
becomes flatter as we increase Tm- Also, for a fixed value of 
7, the effect of the cosmological models on the shape of ^los 
is large for low Tm- 
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The effect of 7 on ^los is twofold - increasing 7 intro- 
duces more smoothing at low scales just like Tm, and there 
is also a reduction in the neutral hydrogen density fluctu- 
ations for given baryonic fluctuations see (equation (|2^)). 
Both these effects make the correlation curve flatter, which 
is what we see from Table |l|. Furthermore, because of this 
twofold effect, 7 affects p much more than Tm does. This 
point can be seen clearly in Figure H In the left figure, we 
have kept x^j constant, and shown the effect of changing only 
the neutral hydrogen density fluctuations. The middle flg- 
ure shows the effect of changing only Tm or, equivalently, the 
Jeans scale Xb, without changing anything else. In the right 
flgure, the full effect of 7 can be seen, as it changes both the 
Jeans scale and the neutral hydrogen density fluctuations. 
Since Tm does not have much effect on p, we can deter- 
mine 7 from observations of LOS correlation even with ill- 
constrained values of T,„, provided the cosmology is known 
from some other studies. 

Just like in the case of Tm , the effect of the cosmological 
models on p is large for low 7. The first reason for this is same 
as in the case of Tm - low 7 implies less smoothing and hence 
DM fluctuations are more effective. The second reason is 
that for low 7, the neutral hydrogen density fluctuations are 
larger for given linear baryonic density fluctuations. Thus, 
a slight change in the DM fluctuations causes a significant 
change in the neutral hydrogen fiuctuations (for a fixed Xb) 
which, in turn, makes the correlation function sensitive to 
the DM power spectrum. 

As the universe evolves after the reionisation, it is pos- 
sible that the value of 7 increases (Hui & Gnedin 1997) 
and the LOS correlation becomes less and less sensitive to 
the DM power spectrum. Hence, if the reionisation has oc- 
curred very early, it is extremely unlikely that one can fix 
the cosmological parameters from observations of ^los- We 
not only need to know the value of 7 and Tm accurately, but 
also the value of p to an accuracy better than 10 per cent. 

Finally, we comment on the effect of h on the shape 
of Clos- As one can clearly see from Table |l|, the effect is 
quite signiflcant (~ 12 per cent). The parameter h affects 
the LOS correlation function (equation dsi])) in three ways 
- (i) it changes the shape of PDM(fc) (see equation (|59|)), (ii) 
it affects the value of Xb (equation M)) and (iii) it affects the 
relation between distance and redshift. As a result, the val- 
ues of ax{z) and duiz) in the integrand of equation (jsi]) get 
modifled. However, it turns out that the last two effects can 
be scaled out. To understand this more clearly, we rewrite 
equation ( jsi) ) in a slightly modifled form 

D{z + Az/2)D{z - A2/2) 



Table 2. Power law index, q, for the evolution of LOS correlation 
function at velocity separation 100 km , defined by 5los(2) <x 
(1 -I- z)~'^ in the redshift range 1.5-4.5. Icr errors are given. 



(5los(2, Az) = 



AK 



27r2 

sm{KDH{z)Az) 
KDh(z)Az 



(1 Xl{z A2/2)7i'2)(i ^ Xl{z - l^z/2)K^ 
where K = k/h and 

Dh{z) = dH{z)h, T,x{z) = ax{z)h, Xb = Xbh 



(63) 



(64) 



One can easily verify that all the three quantities de- 
fined above {DH,'^x,Xb) are independent of h. Hence 
(3los(z, Az) depends on h only through the combination 





Tm ~ (1 - 


+ 2)3^-3 








7 = 1.2 


7 = 1.7 


T„(f = 2.5) 
= 10, 000 K 


SCDM 
OCDM 
LCDMl 
LCDM2 


4.00 ±0.045 
5.69 ±0.066 
4.69 ±0.062 
5.28 ±0.073 


3.12 ±0.025 
4.14 ±0.039 
3.56 ±0.035 
3.91 ±0.042 


T™(f = 2.5) 
= 60, 000 K 


SCDM 
OCDM 
LCDMl 
LCDM2 


3.83 ±0.042 
5.11 ±0.057 
4.44 ± 0.056 
4.96 ± 0.067 


3.38 ±0.024 
4.18 ±0.039 
3.73 ± 0.034 
4.08 ± 0.041 


Tm =constant 






7 = 1.2 


7 = 1.7 


Tm = 10, 000 K 


SCDM 
OCDM 
LCDMl 
LCDM2 


3.97 ±0.045 
5.63 ±0.064 
4.66 ±0.061 
5.25 ±0.072 


2.98 ±0.026 
3.89 ±0.037 
3.42 ± 0.035 
3.75 ±0.042 


Tm = 60, 000 K 


SCDM 
OCDM 
LCDMl 
LCDM2 


3.65 ±0.039 
4.80 ± 0.049 
4.25 ±0.052 
4.73 ±0.061 


2.72 ±0.021 
3.24 ± 0.025 
3.09 ±0.028 
3.33 ±0.033 



P-Dm{Kh). It is very difficult to study this function ana- 
lytically. We have studied it using numerical methods and 
found that for the power spectra used in this paper, its effect 
is to make the LOS correlation steeper when h is larger. 

To compare our results with observational data we take 
the data from Cristiani et al. (1997). The data consists of 
several QSO spectra at various redshifts, ranging from 1.7 
to 3.7. This range is pretty large, and evolutionary effects 
will be significant in the data. We compare the observed 
LOS correlation (points with error bars) with the theoreti- 
cal curve for the four cosmological models, and for various 
ranges of values of Tm and 7 in Figure ^ for z = 2.5. It should 
be noted that the observational data points were obtained 
using the Lya clouds with log(A''Hi/cm~'^)> 14. However, 
we have not used any such constraint while obtaining the 
analytical curves. As a preliminary check, we can see that 
the analytical curves have the broad features which are ex- 
pected from the observational data. We hope to carry out 
a more detailed comparison with observations in a future 
publication. 

We next check the redshift evolution of the LOS correla- 
tion function. For definiteness, we consider ^los at a partic- 
ular velocity, v = 100 km s~^, and study it as a function of z. 
We have assumed that 7 does not evolve with redshift. Since 
we are studying the evolution over a large redshift range, we 
have to consider both the possibilities for the evolution of 
Tm discussed in section ^. The evolution curve closely fol- 
lows a power-law dependence, i.e., ^los(z) oc (1 + z)~' in 
the range 1.5 < z < 4.5. We give the values of q and the rms 
error for different cases in Table |^. 

It is clear that the cosmology has maximum influence 
when 7 and Tm are small. The reason for this is same as 
that discussed in the case of Table ^ 
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Figure 2. The comparison of effect of 7 and Tm on the power law index of 5los(''')- The plots are for LCDMl model. We give the 
relative change in the power law index |Ap/p| for the two curves in each figure. In the left figure, xi, is kept constant. It shows the effect 
of changing the neutral hydrogen density fluctuations. The middle figure shows the effect of changing xi, without changing the neutral 
hydrogen density fluctuations (i.e., 7 is constant). The right figure shows the effect of 7, which affects both xi, and the neutral hydrogen 
density fluctuations. 
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Figure 3. Comparison of the theoretical ^los with observational data (Cristiani et al. 1997). The theoretical curves have been normalised 
in such a way that they match with the observed data point at the lowest velocity bin. In the left figure, the IGM parameters are fixed 
to be Tm = 20,000 K and 7 = 1.7. In the middle figure, the cosmological model is LCDMl, and 7 = 1.7. In the right figure, again the 
cosmology is LCDMl, and = 20,000 K. 



For a given cosmology and a given value of 7, the ef- 
fect of Tm (or, equivalently, Xb) on q is insignificant with the 
relative change in q being about 6-10 per cent. The reason 
for this is as follows: the effect of Xb is significant only for 
scales < Xb- For the parameters we are considering, the ve- 
locity scale corresponding to Xb is ~ 10-30 km s"\ Hence, 
for the case where v — 100 km s~^, the evolution will not 
be affected significantly by the Jeans length. We studied the 
evolution at a higher velocity scale {v — 250 km s~^), and 



found that the effect of Xb was even less (the relative change 
in q was about 3 per cent at w = 250 km s~^). 

The effect of 7 is threefold here - it affects the value 
of Xb, the evolution of Xb (more precisely, this also depends 
on whether we evolve Tm or not) and the neutral hydro- 
gen density fluctuations (through the value of j3). We have 
already seen that the effect of changing the value of Xb is 
not very important. Increasing the value of 7 will make the 
evolution of Xb more rapid. Since X h a, ppears in the denomi- 
nator of the integrand in equation (pl|), the higher the value 
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of 7, the more rapid is the decrease of ^los with increasing 
redshift. This feature alone will increase the value of q with 
increasing 7. But, actually q decreases when we increase 7 
because of the third effect of 7 - increasing 7 reduces the 
value of P (equation (|27|)), and j3 appears in the exponential 
in equation (^^. Hence, the ^los(2) curves will decrease less 
rapidly when P is small, i.e., 7 is large. 

Finally, we note that the effect of whether Tm evolves 
or not becomes appreciable for large values of 7. This is 
obvious because larger the value of 7, more rapid is the 
evolution of Tm- Furthermore, we have already argued that 
if the evolution of Tm is more rapid, then the value of q 
should increase (provided, of course, the value of P remains 
unchanged). Hence, the values of q are smaller when Tm is 
kept constant than when Tm has a redshift dependence. This 
can be verified from Table ^ 

These effects are shown in Figure ^ The curves are 
normalised in such a way that ^los('^ ~ 100 km s~^) = 
0.21 at 2 = 3.85, which is taken from Cristiani et al.(1997). 
In the top row of Figure ^, the value of Tm is fixed at a 
particular redshift (in this case, ai z — 2.5) and the value 
of Tm at other redshifts are calculated using the relation 
Tm ~ (1 + zf''-^. In the bottom row, Tm is kept constant. 
We do not plot the effect of Tm because we find it to be very 
weak. 

The point to be noted here is that the effects due to 
cfiange in cosmology and 7 are of the same order, wfiicli 
means that we cannot constrain both the parameters simul- 
taneously. Thus the above analysis clearly suggest that it 
will be very difficult to recover the power spectrum of density 
fiuctuations uniquely from the Lya absorption lines without 
knowing the IGM parameters. However, the cosmological pa- 
rameters determined through studies such as CMBR (and 
other data), can be used to constrain the equation of state 
(using the plot in the centre in Figure provided we have 
some idea about the evolution of the Jeans length or equiv- 
alently, Tm- 

Throughout this paper we have treated 7 to be indepen- 
dent of z. However, there are indications that 7 could change 
with z. Schaye et al. (1999b) notice that the temperature of 
the IGM has a peak at 2: ~ 3 and it decreases with decreas- 
ing redshift afterwards. They also notice that the slope of 
the equation of state become close to one at 2: ~ 3 then 
increases with decreasing redshift. Theoretical calculations 
suggest that 7 increases with time and the rate of evolution 
depends on the reionisation epoch (Hui & Gnedin 1997). 
From Table |^ and Figure ^ we can infer that when 7 be- 
comes larger the rate of growth of ^los at a given velocity 
decreases. Thus our study clearly suggests that the evolu- 
tion of ^LOS at a given velocity can be used as probe of a 
reionisation and thermal history of the IGM once the cos- 
mological model and the evolution of Xb is fixed. We hope 
to study this in detail in a future publication. 

4.2 Transverse Correlation 

In this section we present the results for the transverse cor- 
relation. As before, we consider the same CDM power spec- 
trum, and essentially the same range of the IGM parameters. 
The smoothing velocity is taken to be 30 km s~^, which is 
the typical peculiar velocity of a blob in the IGM. 

Given z, we calculate ^trans as a function of the trans- 



verse comoving distance l± . One can then convert this length 
scale to an angular scale 6 through the following relations. 

(65) 
(66) 



where Q,k is given by equation (^l|), x{z) is given by equation 
(S) and 




{sin r (if Qk < 0) 
r (if fifc = 0) 

sinhr (if Qk > 0) 



(67) 



Instead of plotting the correlation function directly, we 
plot the quantity VeiO), which is defined as follows. The 
excess probability, over random background, of finding two 
neutral hydrogen overdense regions separated by a comoving 
transverse distance l±_ is 



gtrans(^±; 2)27r/j_dZ± 

47r[dg°'"(2)]2 



(68) 



Using equation we get the excess probability over ran- 
dom background of finding two neutral hydrogen overdense 
regions separated by an angle 6 as 



Ve{e) d6» 



1 



Ctrans(e) 6 AO. 



(69) 



From Figure]^ it is clear that even the maximum excess prob- 
ability of finding two H I overdense regions over an angular 
scale greater than few arc seconds is less than 1 per cent. 
Observationally the distribution of H I along the transverse 
direction is probed by studying the common absorbers along 
the LOS towards closely spaced QSOs. The angular scales 
probed varies between few arc seconds and few arc minutes 
(Shaver, Boksenberg & Robertson 1982; Shaver & Robertson 
1983; Smette et al. 1992; Dinshaw et al. 1994; Bechtold et 
al. 1994; Crotts et al. 1994; Bechtold & Yee 1994; Smette et 
al. 1995; D'Odorico et al. 1997; Petitjean et al. 1998). Based 
on our analysis it is most likely that the common absorbers 
seen in the spectra of closely spaced QSOs are most likely 
probe the transverse extent of the same overdense region 
rather than the clustering length scale of separate regions. 

4.3 Difference between ^trans and ^los 

It should be noted that for a given mean redshift, the values 
of the LOS and the transverse correlation functions need 
not be the same. This is because, when we observe along 
one LOS, we actually sample different points at different 
redshifts. In contrast to this, the transverse correlation is 
calculated at the same redshift. The effect of evolution in 
the LOS correlation makes it different from the transverse 
correlation. 

To illustrate this point more clearly, let us first assume 
that Xb does not evolve with z. Then from equations (^^ and 
(p3|), we see that for a given length scale Z, the integrands in 
the two equations are identical. Hence, we get 



D{z + A2/2)T»(2 - A2/2) 



Qt 



D^{z) 



(70) 
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Figure 4. Clos(v=100 km s~^) as a function of z. The curves are normalised in such a way tliat 5los(v=100 km s~^) = 0.21 at 
2 = 3.85, which is taken from Cristiani et al.(1997). In the left plots, the IGM parameters are fixed Tm = 40, 000 K, 7 = 1.5. In the right 
plots, we have fixed the cosmological model to be LCDMl, and Tm = 40, 000 K. 



where z and Az are defined in equations ( |42|) and (|49| ) re- 
spectively. The diflterence in the two correlation functions is 
now entirely due to the evolution of the power spectrum. 
Thus the two correlation functions will be nearly equal for 
small Az but will start differing from each other for large 
Az. In the general case when xt, evolves with z, the difference 
will be much more prominent. 

This is indeed true, as one can see from Figure ^ For 
scales below 200 Mpc, the two correlation functions are 
nearly the same. But above such scales the two functions 



start differing appreciably. For the observations made in the 
scales of 10-100 h'^ Mpc, our analytical calculation shows 
that one should not see any appreciable difference between 
LOS and transverse correlations. This can be used as a im- 
portant tool determining the power spectrum (provided, of 
course, we know the IGM parameters and the correlation 
function completely). As we have argued earlier, one can- 
not get the power spectrum from the LOS correlation. But 
the power spectrum can be obtained from the transverse 
correlation in usual manner. Since the two correlations are 
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Figure 5. Plot of VgiO). Results for four different cosmological models and for two different redshifts are presented. The IGM parameters 
are 7 = 1.5,Tm(2 = 2.5) = 40,000K. Tm at redshift z = 3.5 is calculated using the relation Tm. ~ (1 + 2)^^"^. 




Figure 6. The ratio of ^los a^nd ^trans as a function of comoving scale /. The results for four cosmological models and for three different 
mean redshifts are plotted. The curves for the LCDMl, LCDM2 and the OCDM models nearly overlap. The model parameters relating 
to the IGM are 7 = 1.5, Tm (2 = 2.5) = 40, OOOK. T„i at the other two redshifts are calculated using the relation Tm ~ (1 + z)^T"3. 



identical for scales upto 100 h~ Mpc, one can start from the 
LOS correlation, replace it with the transverse correlation, 
and obtain the power spectrum. 



4.4 Column Density Distribution 

In this section we study the results for the column density 
distribution. As we have discussed in section ^, we shall con- 
sider two separate cases for the evolution of To and Tm, i.e., 
(i) (To and T^) cx (1 + zf'''^ and (ii) Tq and Tm are in- 
dependent of z. The comparison between the two cases is 
shown in Figure ^, where we plot the quantity f{Nm) for 
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Figure 7. Column density distribution of neutral hydrogen plot- 
ted for different kinds of evolution of Tm and Tq for two red- 
shifts. The thicker lines represent the case where both of them 
are constant. For definiteness we have taken Tm = 40, 000 K 
and To = 20, 000 K. The thinner lines are for the case where 
both the temperatures vary as (1 -|- z)^^~^. The values are 
Tmiz = 2.5) = 40, 000 K and To (2 = 2.5) = 20, 000 K. Obviously, 
the curves for the two cases overlap at 2 = 2.5. The cosmology is 
taken to be LCDMl, 7 = 1.7 and e = 0.3. 

the two different cases. /{Nm) is related to {dNpi^/dz dA^ni) 
through the relation (Bi & Davidsen, 1997) 

f{Nm) = (diVpk/d2 d7VHi)/(l + z). (71) 

As we can see from Figure |^ the difference between the 
two cases becomes more significant at higher column den- 
sities. We have checked and found that Tm has very little 
effect on the column density distribution. The difference be- 
tween the two cases at z — 3.5 is because of the fact that 
the value of To is different for the two cases at that redshift. 
However, we have already mentioned that any uncertainty 
in the knowledge of Tb can be compensated to some extent 
by changing e. Hence, for studying the column density dis- 
tribution we shall consider only the case where both Tm and 
To evolve as (1 + zf'''^. 

We shall now discuss the dependence of the column den- 
sity distribution on the following parameters : (i) cosmolog- 
ical models, (ii) e and (iii) 7. 

In what follows we try to get constraints on our model 
parameters using the observed column density distribution 
obtained from Hu et al. (1995) and Kim et al. (1997) at 
three mean redshifts 2.31, 2.85 and 3.35. In Figure H we 
have plotted the quantity f{Nm) for those redshifts. The 
observational data points are the points with errorbars in 
the figure. 

For all the plots in Figure H, we have fixed Tm{z = 
2.5) = 40,000 K and To{z = 2.5) = 20,000 K. In the left 
most panel of each row in the figure we plot the predicted 
column density distribution for various cosmological models 
for a given set of IGM parameters (7 — 1.2) and e — 0.3. 
We did not plot the LCDM2 model because it completely 



overlaps with the LCDMl model. In all the redshift bins it is 
clear that the SCDM curves fall steeply at the higher column 
density end compared to other models. This is consistent 
with the results noted by Gnedin & Hui (1996). However, 
in our method of obtaining f{Nm), the SCDM model can 
be made to fit the data by slightly increasing the value of e. 
The OCDM and LCDMl curves fit the observed distribution 
upto log(A''Hi/cm"2) = 14.2 for e = 0.3. 

In the middle panel of each row in Figure ^ we plot the 
predicted distribution for the three assumed values of e for 
LCDMl with 7 = 1.2. Increasing the value of e enhances 
the column density of a cloud (with fixed peak density), 
and consequently, the value of f{Nm) increases at the high 
column density region. It is clear that the observed distri- 
bution is consistent with e — 0.3. This means that, in the 
case of LCDM, the effective length of the overdense region 
is about (1/3) of the coherence scale R* . The value of _R* 
depends on the baryonic power spectrum, its typical value is 
of the order of few hundreds of Kpc; for LCDM model with 
7 = 1.2,r„(z = 2.5) = 40,000K, we get R* ~ GO/i"^ Kpc. 
Hence the typical length of an individual overdense region 
is ~ 20h~^ Kpc. Zhang et al. (1998) infer that the typical 
LOS scale length of an absorbing cloud is ~ 15-35 Kpc 
through their hydrodynamical simulations. Our values are 
consistent with theirs. 

The effect of 7 on the column density distribution can 
be seen from the right most panel in Figure ^ where we 
plot the results of LCDMl model for three values of 7. In 
the models with higher value of 7, the f{Nm) curve falls 
sharply at higher column density end at all redshifts, which 
is consistent with the conclusions of Hui et al. (1997). The 
reason for this sharp fall is because the neutral hydrogen 
density is less (for a given baryonic density) when the value 
of 7 is more and, hence there are less number of clouds with 
high column densities. At low redshifts, it is clear that lower 
values of 7 are preferred for e = 0.3. However, one can get 
the consistent fit for higher values of 7 by increasing the 
value of e. For example, at z — 2.31, one can fit the data 
reasonably well with e in the range 0.3 to 0.5 and 7 in the 
range 1.2 to 1.6. But it is impossible to fit the data with 
7 > 1.6, for any choice of e at z = 2.31. This implies that 
we cannot accommodate 7 larger than 1.6 at this redshift 
even by changing the values of To or other IGM parameters. 
At 2: = 2.85, one can fit the observations for 7 — 1.7 (the 
highest value of 7 we are considering in this paper) , by tak- 
ing e = 0.4. For the case where z — 3.35, we can see that 
we can marginally fit the observations for whole range of 7 
considered in this paper with e = 0.3 itself. 

Finally, we comment on how our results compare with 
some of the hydrodynamical simulations. It has been ob- 
served through numerical simulations that the peak bary- 
onic over-density and the neutral hydrogen column density 
are strongly correlated (Zhang et al. 1998; Dave et al. 1999). 
We tested equation ( p7[ ) against the scatter plots showing 
the correlations. The parameters were taken to be: LCDMl 
cosmological model with 7 — 1.2, e = 0.3 at z = 3. These 
are the parameters which fit the observations for the column 
density distribution reasonably well (see Figure |^ . It turns 
out that the relation matches well with the median of the 
scatter plot in Figure 8 of Zhang et al. (1998). Dave et al. 
(1999) give an analytical formula which relates ne [peak] /no 
and TVhi by a power- law (see equation (7)). We find that 
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Figure 8. /(A'^hi) as a function of A'^hi for three redshifts. We show the dependence of /(Afjji) on cosmology (left), e (centre) and 
equation of state 7 (right). For all the plots we have taken Tm(z = 2.5) = 40,000 K and To{z = 2.5) = 20,000 K. For the left plots, 
we fix 7 = 1.2, e = 0.3. We have not plotted the curve for the LCDM2 model because it completely overlaps with that for the LCDMl 
model. The centre plots are for the LCDMl model with 7 = 1.2. The right plots are also for LCDMl with e = 0.3. 



one needs a much higher value of 7 (~ 1.82) in our model 
to match the power-law index at redshift 3. Also, one needs 
a e ~ 1 to match the overall scaling factor. If we take such 
values of e and 7, then our results for the column density 
distribution will not be able to match the observations. We 
believe that such a discrepancy arises because of the differ- 
ence in methods used for assigning a column density to a 
cloud. 



We next test the column density distributions against 
some of the recent simulation results. In the column den- 
sity range lO^'^'^-lO^'*'^ cm~^, one can fit a power-law of the 
form /(iVni) oc N~^™ . The values of /3hi for our LCDMl 
model with 7 = 1.2 and e = 0.3 are 13m = 1.70 ± 0.02 
for z = 2.31, /3m = 1.65 ± 0.04 for z = 2.85, /3hi = 
1.56 ± 0.04 for z — 3.35. When compared with fits to 
the observational points, the corresponding power-law in- 
dices are 1.35 ± 0.03, 1.39 ± 0.26, 1.59 ± 0.13, respectively 
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(Kim et al. 1997). All errors given here are 2a. Simulations 
of Zhang et al. (1997) using the SCDM model produce a 
/3hi = 1.39 ±0.06 in the range 2 x 10^^ < Nm/cm~'^ < lO" 
for z = 3. Our curves are also in quite good agreement with 
the P^M-SPEl simulations (using SCDM model) by Theuns, 
Leonard & Efstathiou (1998) and Theuns et al. (1998) in 
the same column density and redshift ranges. More recently, 
Machacek et al. (2000) have performed hydrodynamical sim- 
ulations for various cosmological models. The LCDM model 
{Qm = 0.4, D.A = 0.6, h = 0.65, as = 1.0) with Hb/i^ = 0.015 
gives /3hi = 1.61 ± 0.04 for 2 = 2 and /3hi = 1.48 ± 0.04 for 
2 = 3 in the range lO^^ ** < iVni/cm"^ < 10^''-^. They note 
that the power-law index is higher for the SCDM model, 
and lower for the OCDM model. This is also consistent with 
what we get from our model. 



5 CONCLUSIONS 

We have presented a simple analytic expression for the cor- 
relation function and the column density distribution for 
the low H I column density systems seen in the spectra of 
high redshift QSOs. We have used our results to get con- 
straints on various cosmological and IGM parameters. We 
summarise our main results below. 

1. One cannot rigourosly define a power spectrum from 
the LOS correlation function. However, since the LOS and 
transverse correlations are identical below scales of ~ 100 

Mpc, it is possible to obtain the power spectrum by a 
Fourier transform of the LOS correlation function (provided 
the IGM parameters are known). 

Previous studies have attempted to recover the power 
spectrum of density fluctuations from the observations of 
the IGM (Croft et al. 1998, Hui 1999, Croft et al. 1999). We 
show that it is difficult to recover a unique power spectrum 
from H I correlation function without constraining the IGM 
parameters, especially 7. We have shown that the shape of 
the LOS correlation function at a particular mean redshift 
becomes less and less sensitive to the DM power spectrum 
as the universe evolves after reionisation. We feel that the 
correct approach in studying this issue is to constrain the 
cosmological models using CMBR or supernovae data, and 
apply those constraints to study the IGM parameters using 
H I correlation functions. 

2. The LOS correlation function and its evolution is 
much more sensitive to 7 than Tm- Using observations which 
give the shape of the LOS correlation at a particular redshift, 
one can constrain the value of 7, even with ill-constrained 
values of Tm, provided the background cosmology is known. 
However, the more uncertain is the value of Tm, the less 
constrained is 7. Carrying out such an exercise for differ- 
ent redshifts, it will be possible to constrain the evolution 
of 7. However, for this study, one needs accurate observa- 
tional data at different redshift bins which are not affected 
by evolutionary effects. Thus one can use the study of cor- 
relation function as an independent method to constrain the 
reionisation history of the universe. 

3. The analytic column density distribution for H I, like 
Clos, is less sensitive to Tm than 7. The distribution, when 
compared with observations, favours a lower value of 7, al- 
though at redshifts > 2.5, one can marginally fit the obser- 



vations with higher values of 7. Our model clearly rules out 
7 > 1.6 at redshift 2.31. 
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APPENDIX A: DETAILED CALCULATIONS 
FOR THE COLUMN DENSITY DISTRIBUTION 

Suppose we are looking at the IGM along any one direction, 
at some redshift z. Then the linear density field 5^'^\x,z) 
along that axis will be described by a one dimensional Gaus- 
sian random field, with a power spectrum 



{k,z) = —D\z) 



dk'k' 



{l + xl{z)k^)^ 



(Al) 



From now on we shall derive all the expressions at a 
particular redshift z, and we shall not write the explicit z- 
dependence on the quantities. 

To define the column density, we associate each local 
maximum or peak in the linear density field to a Lya cloud. 
The column density corresponding to such a cloud is given 
by equation ( |56| . We have expressed S^^^ [peak] in terms of 
A'hi in equation ([H^). 

Using the properties of a Gaussian random field, we 
can derive the joint probability distribution for the three 



Gaussian random fields S 



(ID) .(ID)// o{lI3)/ 



The probability 



that the field and its second derivative have values S 



(ID) 



and 



respectively at the peak 5 



(ID)/ 



is 



exp 



01 d5^ 



(l-D) (J^(ID)" ix{l-D)// 



dx • 



(27r)3/2criE 
, 2r(lD)/;2 
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2S2 



f 2 dioy 



d^i^) dSi'^'>" \Si'^^"\ dx, 



where 



2^ 



dk k^^^P, 



2m r,(lD) 



{k,z) 



and 

v^2 2 2 

2j = (Jq(J2 



4 

o-i. 



(A2) 



(A3) 



(A4) 



Note that cjq = A, defined in equation (g|). 

For convenience, let us define some dimensionless quan- 
tities 



.{ID), I 



CTO 



cr2 



O"ocr2 



(A5) 



u and A measure the field and its second derivative, respec- 
tively; K is a measure of the width of the power spectrum. 
One can use these quantities to obtain the number of peaks 
(clouds) per unit length 



diV, 



pk 



da; 



-prr(lD) c(lD)// c(lD); 



01 d5^^°'> d5^^""' |5^^^^"|(A6) 



,(1D)// |^(1-D)"| 



Using equation (|39|), one can convert the above expression 
to the number of clouds per unit redshift interval. After 
simplification, the relation becomes 



dz (27r)3/2Vl-K2 7^* 
1 f (ly-nXf 



exp 



A dA df 



2 [ l-K- 

The A integration can be carried out to obtain 



(A7) 



diV. 



pk 



dniz) 



dz dp (27r)3/27?* 



K exp 



erfc 



2(1 -k2) 



-1/^/2 



2 --\^ 
We are interested in the quantity 



diVpk 



diVpk du 



dz dNni dz dv dNni 

which is straightforward to obtain from equation 
vided we know i/ as a function of A^hi- Equations 



(A8) 



(A9) 




(A5) give V in terms of A'hi, and they can be used to calcu- 



late dv/dNm (for nni <^n^) 
du 1 



diVni PANni 



(AlO) 



Thus we get an analytic expression for the number of clouds 
per unit redshift interval per unit column density range 
(dApk/dz dAm) as a function of Am. 
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